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Abstract We study a connection between two topics: Decoding of Goppa 

codes arising from an algebraic curve, and rank two extensions of certain line 

bundles on the curve. The material about each isolated topic is well known. 

Our contribution is just to expose a connection between them. 
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1. Introduction 

Let C be a curve over a finite field F q . Some years ago V. Goppa showed 
how to produce codes from such a curve (For a survey, see for example Gieseker 
[2j ). In this note we will show how socalled syndrome decoding of (the duals of 
the original ) Goppa codes in an intimate way is connected to the study of rank 
two bundles, that are extensions of the structure sheaf Oc and a certain fixed 
line bundle on C determined by the code in question. In fact, each syndrome 
corresponds to an extension. Moreover syndromes due to correctable error 
vectors always correpond to unstable rank two bundles, that is: semistable 
bundles never correspond to syndromes of correctable errors (Here we call an 
error vector, or simply error, correctable if its Hamming weight is at most 
where d is the designed minimum distance of the code). For correctable 
errors the process of error location is translated into finding a certain quo- 
tient line bundle of minimal degree (or dually: a line subbundle of maximal 
degree) of the rank two bundle defined by the syndrome, and then pick the 
relevant section of this line bundle. If an error is not correctable, there is not 
necessarily a unique such quotient line bundle of minimal degree. We gain 
our insight through a certain projective embedding CcP, with the property 



that the columns of the parity check matrix of the Goppa code in question are 
interpreted as points of the embedded copy of C. We study j-secant (J — 1)- 
planes to C, for j — 1, 2, ... . Using the proper definitions of these geometrical 
objects, we thus obtain a stratification of P, which viewed from one angle is 
a stratification of syndromes, according to how many errors that have to be 
made to obtain the syndrome. Viewed from another angle it is a stratification 
of rank two extensions according to the socalled s-invariant. The geometric 
picture associated to rank two extensions was given very explicitly in Lange 
and Narasimhan [TT], and the basic idea was given already in Atiyah p.]. We 
just remark that the aspects interesting to us carry over to the case of positive 
characteristic. Then we compare with the picture obtained from the space of 
syndrome vectors. So far we have not been able to utilize these geometrical ob- 
servations to make any constructive decoding algorithms. To do so one would 
have to introduce some kind of "extention arithmetic" to perform decoding. 
Some explicit considerations along these lines are made in |3j. For constructive 
decoding algorithms for Goppa codes in general, see the celebrated Feng and 
Rao [6j, or for example Duursma [H], Justesen et al [H3], Pellikaan [l2j, Sko- 
robogatov and Vladut [TJ], or the special issue IEEE Trans, of Info. Theory, 
Vol. 41, No.6, Nov. 1995. 

In Section 2 we recall some basic facts concerning algebraic-geometric 
(Goppa) codes. We also define and make an elementary study of the secant 
varieties that in a natural way turn up in connection with these codes. In 
Section 3 we introduce vector bundle language, and in Theorem 13 .41 we present 
the connection between correctable errors and unstable bundle extensions of 
rank two. 

Remark 1.1. In Goppa [8 J one describes an algorithm for decoding of Goppa 
codes from rational curves. There one assumes first of all that at most t errors 
are made, where t — [(d — l)/2]. Then one assumes that exactly t errors are 
made, and sets up a system of equations to solve the problem of error-location 
given that this extra assumption holds. If the equations yield no solution, one 
assumes that t — 1 errors are made, sets up a new system of equations, and so 
on. In the end one arrives at a point where one finds a solution since the basic 
assumption is that at most t errors are made. The coefficient matrices, set up 
to find the elementary symmetric functions in the parameter values of the error 
locations are Toeplitz, and in particular symmetric. This process is in many 
ways reminiscent of describing complete quadrics through various blowing-ups 
of the space of usual quadrics, which is again a space of symmetric square 
matrices. Hence, in order to generalize to arbitrary curves, one could describe 
some kind of analogy to complete quadrics for curves of positive genus. Such 
a generalization, in terms of an object obtained through various blowing-ups 
of the secant strata of C inside P, has been given in Bertram using vector 
bundle language. It is possible that understanding this or similar objects can 
give new insight into decoding of Goppa codes. 



2. Definitions and basic facts about Goppa codes 

A g-ary code of length n is a subset of the vector space F q , where F q is a 
finite field with q elements. A linear code is a linear subspace of F q . Let F q 
be an algebraic closure of F q . Let C be a curve of genus g defined over these 
fields. Let D and G be divisors on C defined over F q , such that their supports 
are disjoint. Moreover the support of D consists of n distinct points P\, P n 
of degree one, where n = deg(D). For a divisor M defined over F q , denote by 
L(M) the set of the zero element and those elements / of the function field 
F q (C), such that (/) + M > 0. Denote by l(M) the dimension of L(M) as a 
vector space over F q . This dimension is the same as the one obtained if we 
work over F q . 

Following for example van Lint et al [TH] we denote by C(D,G) the code, 
which is the image of L(G) in F q under the map: 

0:/ - (/(Pi), J(Pn)). 

In Pellikaan et al [12], such a code is called a WAG, which is short for weakly 
algebraic- geometric code. Moreover one shows there that all linear codes are 
WAG. By the theorem of Riemann-Roch (which remains valid over finite fields) 
we have: 

dim{C{D, G)) = 1(G) - dim{kerf) = 1{G) - l(G — D) = 

deg(G) + 1 - g + 1{K - G) - 1{G - D). 

As usual K denotes a canonical divisor. Set m = deg(G). A WAG is called a 
SAG (strongly algebraic-geometric code) if the following composite condition 
is fullfilled: 2g-2 < m < n. For a SAG we observe: l(K-G) = l(G-D) = 0, 
and hence: dim C(D, G) = m + 1 — g. By a generator matrix for a code one 
means a (k x n)-matrix, where the rows constitute a base for the code as a 
linear space over F q . A generator matrix for C(D,G) is: 

rA(Pi) A(P 2 ) ■■■ A(P„)1 



L/*(Pi) /fc(P 2 ) ••• fk(Pn) J 

where k = m+ 1 — g, and fi, fk is a basis for L(G), both over F q and over 
F q . By a parity check matrix for a code one means a ((n — k) x n)- coefficient 
matrix of a set of equations cutting out the code as a subspace of F™ and of F q . 
One denotes by C*(D, G) the linear code having the matrix M above as parity 
check matrix. Hence C*(D,G) is the orthogonal complement of C(D,G) and 
vice versa. One also says that the codes are dual to each other. For a WAG 
defined as C(D,G) consider the exact sequence of sheaves on C: 

0^ 0{G-D)^ 0{G)^ 0(G)/0(G — !?)—> 0. 



The long exact cohomology sequence gives: 

-> L(G — D) —> L(G) -> F™ -> 

H\C,0(G- D)) -> H\C,0{G))^ 0. 

Moreover by Serre duality : iP(C, C(G - D)) is dual to L(K + D - G), and 
H l {C, 0(G)) is dual to L(.K" — G). For a SAG the long exact sequence reduces 
to: 

(1) -> -> -> L(K + D-G)* -> 0. 

Here F" can (by Riemann-Roch) be chosen such that G* = K + D — G has 
support disjoint from -D. We can identify the elements of L{K + D — G) 
with differential forms being zero of order the same as order(G) at the points 
of the support of G, and having at most simple poles at the points of the 
support of D, and no poles elsewhere. Hence we see that evaluating elements of 
L(K+D — G) can be interpreted as evaluating residues of the differential forms 
described (after multiplying each value f(Pi) by a non-zero value Resp^rj), 
where K = (77) ). Moreover we have for / in L(G) and u a differential form as 
described: 

(2) = ^Resp-ifu) = Zif(PdRes Pi (u). 

From equations (pQ) and (J2j) we easily conclude that C(D,G*) is code equiv- 
alent to C*(D, G). In the original work by Goppa the code obtained from the 
divisors D and G was C*(D,G), and it was obtained by means of residues of 
differential forms. By a Goppa code we will here simply mean a SAG. One 
easily verifies that C(D, G*) is a SAG if and only if C(D, G) is so. For a SAG 
we see that if {hi, ■ • ■ , hk*} is a basis for L(G*), then a parity check matrix 
for C(D,G) is (essentially, after a trivial equivalence operation): 

~hi{Pi) hi(P 2 ) •■■ hi(P n ) 

M* = 

_MPi) h k ,{P 2 ) ••■ h h .{P n )_ 
where k* = n — k = n — m + g — 1. 

Remark 2.1. We see that the columns of M* represent the points of the 
support of D if C is embedded into P = p«~ m +s-2 = p(H°{C, K + D- G)*) 
by means of sections of L(G*). 

For w x and w 2 in let the Hamming distance <i(w 1 ,w 2 ) denote the 
number of coordinate positions, in which Wi and w 2 differ; it is clearly a 
metric on F™. Let the minimum distance of a code be the minimum Hamming 
distance for any pair of codewords. For a linear code this is easily seen to 
be the minimum number of non-zero coordinates (minimum weight) for any 
non-zero codeword. Denote by di the minimum distance of the code C(D, G). 
If a code word has weight d\, then there is a divisor D 1 with Di < D, and 



deg(-Di) — n — di, such that L(G — Di) ^ 0, Hence m — (n — di) > 0, that 
is: d\ > n — m. We denote by d the integer n — m, which is positive since 
C(D, G) is a SAG. We call d the designed minimum distance. We also see that 
C is embedded into P(H°(C, K + H)*), where d = deg{H) (and H — D — G). 
Denote by t the integer [(d— l)/2]. We also call t the designed error correcting 
capacity. Recall the basic fact: 

Remark 2.2. Let N be the parity check matrix of a code. The minimum 
distance of the code is equal to s if all choices of s — 1 columns of N are 
independent, and some choice of s columns of N are dependent. 

Let x be an element (codeword) of C(D,G) C F™, and assume that x is 
transmitted, and y = x + e is received. The difference e is called the error 
vector. Denote by S(y) the matrix product M*y. Clearly S(y) is a vector 
in F£* , and S(y) = 5(e) = if and only if y is itself a codeword. S(y) is 
called the syndrome vector of y. We can also interpret S(y) as a point of 
P = P fc _1 = ~p d+9 ~ 2 . For each integer a, let the (Hamming) a-ball centered 
at x be the set of those y, such that d(x, y) < a. The following is immediate 
from the triangle equality: 

Remark 2.3. The restriction of the map S: F™ — > F^* to any t-ball is 
injective. 

Secant varieties 

Now we view C as any curve defined over the algebraic closure F q , and let 
C be embedded in some projective space over this field. Let A be an effective 
divisor on C, possibly with repeated points. Let Cj be the j'th symmetric 
product of C, for j=l,2,... . 

Definition 2.4. (a) We denote by Span(A) the intersection of all hyper- 
planes Ti, such that we have: Ej/(Qj, Cfl TL)Qi > A (Here I(Q, V\C\ V 2 ) 
denotes the usual Bezout intersection number of two varieties of com- 
plementary dimension at a point Q). 

(b) We say that C is k-spanned if dim (Span (A)) — j — 1, for all A with 
deg(A) = j , and j < k + 1. 

(c) We set SeCj(C) = USpan(A), where the union is taken over all A in 

(d) For a point P in projective space we set h(P) = h if P is contained in 
See h {C)-Sec h - X {C). 

Proposition 2.5. Let C be the curve treated in Section 2, defined over F q and 
embedded into 'p d+9 ~ 2 by the linear system K + D — G as described. Then we 
have: 

(a) C is (d — 2) -spanned. In particular C is smoothly embedded if d> 3. 

(b) If h(P) — h < [(d — l)/2] = t, then there is a unique effective divisor 
A with degree at most h, such that P is contained in Span(A). 



Proof, (a) An easy application of Riemann-Roch. Let A be a divisor of degree 
j < d - 1. Set H = D - G. Then 

l(K + H-A) = 2g-2 + d-j + l-g + l(A-H) = d + g-l-j = l(K+H)-j, 

so A imposes j independent conditions on the linear system. 

(b) Assume P is contained in Span(Ai) fl Span(A 2 ). If the supports of 
the divisors A\ and A 2 are disjoint, then we have: Span(Ai + A 2 ) = the linear 
span of Span(Ai) U Span(A2), so 

dim(Span(Ai + A 2 )) = 1 + dim(Span(Ai)) + dim(Span(A 2 )) — 

dimdSpaniAx) n Span(A 2 )) < 1 + 2{h - 1) - 1 = 2h - 2. 

Hence C is not (2/i — l)-spanned, and thus not (d — 2)-spanned, since h < 
[(d — l)/2]. We leave it to the reader to modify the argument if the supports 
of the two divisors are not disjoint. □ 

By abuse of notation (See Remark 12.11 above) we denote by Pi column nr. 
i of the parity matrix M*. Assume that a codeword x is transmitted, y is 
received, and that the error vector e has weight h with coordinates e±, ■ ■ • , 
in positions i±, ■ ■ ■ ,ih respectively. We have: The syndrome S(y) = S(e) = 
e x P\ + • ■ ■ + ehPh- Interpreting 5*(y) as a point of P, we then see that S(y) is 
contained in Sech(C), and that ^(^(y))^ h. Moreover it is clear that if h < t, 
then h(S(y))= h, and that the "error divisor" P% + • " + Ph is the unique 
divisor A of degree at most h over F q , such that S(y) is contained in Span(A). 
So, error location amounts to finding such a divisor A, given the point S(y). 
A priori we know that this divisor consists of distinct points, all of degree 1 
defined over F q , and that even the errors e%,--- ,eh are in F q . 

3. Vector bundles of rank two on C 

We continue using the notation from Section 2. The following exposition 
is to a great extent taken from Lange and Narasimhan [TT] and Bertram [H]. 
Let Exto c (H, Oc) be the set of isomorphism classes of exact sequences 

(e) : -> O c -> E -> H -»• 0. 

The map Oc — » E is denoted by / and the map E — > H by g. The zero 
element (eo) corresponds to the case of a split exact sequence. Here Oc as usual 
denotes the structure sheaf on C, and H is the fixed line bundle or invertible 
sheaf D — G, see Section 2 (by abuse of notation we do not distinguish between 
the divisor H = D — G, or the invertible sheaf or line bundle, of which the 
divisor corresponds to a global section). The middle term E is a locally free 
sheaf, or vector bundle, of rank 2. Standard cohomology theory and Serre 
duality give: 

Ext 0c (H, O a ) = Ext 0c (O c ,-H) = H\C, —H) = H°(C, K + H)*. 



Hence P(Ext 0c {H, O c )) = P(H°(C, K + H)*) = P. This means that (up to 
isomorphism and a multiplicative factor) the points of our well-known projec- 
tive space P described in Section 2 are identified with extensions as described. 

Definition 3.1. Let E be a rank two vector bundle on C . 

(a) Denote by s(E) the integer 

deg(E) — 2max(deg(L)) = 2min(deg(M)) — deg(E), 

where the maximum is taken over all line subbundles L of E and the 
minimum is taken over the quotient line bundles M of E. 

(b) E is called stable if s(E) > 0; semistable if s(E) > 0; unstable if 
s(E) < 0. 

(c) For an extension (e) as above we set s((e)) = s(E), where E is the 
middle term. An extension is called stable (semistable, unstable) if the 
middle term E is so. 

The definitions of stable and semistable coincide if deg(E) is odd. For the 

zero element (e D ) we observe: s(E) = —d, and for all non-split extension (e) 
we have d > s((e)) > 2 — d. Moreover, if M is a quotient line bundle of E of 
minimal degree (s + d)/2 > 1, with quotient map h, then the composition of 
h and / is non-zero: 

O c -> E — > M. 

Hence M is isomorphic to Oc(A), for an effective divisor A of degree (s + d)/2. 
This implies again that (identifying (e) with its corresponding point of P) the 
point (e) is contained in the kernel of the map: 

Ext l 0c {H,O c )^ Ext^i^OdA)), 

that is, in the kernel of: 

H°(C, K + H)* — > H°{C,K + H - A)*. 

This observation has many consequences. First we see that the set of points in 
P with s- value 2 — d are precisely those that represent bundles with a quotient 
bundle of type Oc(Q), for some point Q on C. If we assume that d > 3, then 
C is smoothly embedded by Proposition 12.51 Then we can identify C with its 
embedded image in P, and the point Q on C then corresponds to a bundle 
extension with a quotient bundle isomorphic to Oc(Q). Moreover it is clear 
that the observation above is equivalent to: (e) is contained in Span(A). 

Remark 3.2. Arguing in a dual way, we get that if (e) is contained in Span(A), 
then the line bundle corresponding to H — A is a subbundle of E. 

Summing up, we now formulate the following result, which is practically 
identical to Proposition 1.1. of Lange and Narasimhan [TT] (Recall the func- 
tions h and s from P to Z, introduced in Definition I2.4| d.) and Definition 
13.11 (a), respectively). 



Proposition 3.3. Let P be a point of P. Thens(P) = 2h(P)—d, In particular 
P is a unstable point (semistable, stable) if and only if h(P) < d/2 (h(P) > 
d/2, h(P) > d/2). 

We are now able to formulate the main result of the paper: 

Theorem 3.4. Let P = S(y) be the syndrome of a received message using 
the code C(D,G). Then P is the syndrome of some error vector with weight 
at most the designed error correcting capasity t = [(d — l)/2] only if P is an 
unstable point. Moreover in that case the process of error location is reduced 
to finding the error divisor A among global sections of the unique quotient line 
bundle of degree h of the vector bundle E(P) of rank two, appearing as the 
middle term in the extension corresponding to P. 

Proof. This follows directly from Proposition 13.31 and the argument above. □ 

Remark 3.5. The "only if" in the theorem can be replaced by "if and only if" 
if we define the syndrome map over F q . 

Remark 3.6. The definitions of stable, semistable, unstable can be viewed 
as special cases of more general definitions of these concepts in the setting of 
Geometric Invariant Theory (GIT), which again is an essential tool in building 
moduli spaces parametrizing various objects. The spaces arise as quotients 
of various group actions. In order to get quotients with good properties one 
usually has to disregard certain "bad objects", which are the unstable ones. 
In our case the relevant construction is that of SU.2{C), the moduli space of 
isomorphism classes of vector bundles of rank two on C (See Gieseker p. 
51-52). This has dimension 4g — 3 = g + (3g — 3) , where the sum decomposition 
corresponds to g = dim(Jac(C)) degrees of freedom to choose a line bundle 
H, and 3g — 3 = dim(SU2(C, H)) degrees of freedom to choose the rank two 
bundle with determinant H (where iSW 2 (C, H) = modulo space of rank two 
bundles with determinant H). Moreover, for SU 2 (C,H) there are essentially 
only 2 cases; deg(H) odd and deg(H) even, since tensoring a rank two bundle 
with a line bundle gives rise to an isomorphism between two such spaces with 
determinants H with degrees of equal parity. One can show that the natural 
map: 

P - Sec t {C) -> SU 2 {C,H) 
is birational if d = deg(H) — 2g — 1, and that it maps birationally on to the 
^-divisor if d = deg(H) = 2g — 2, where the ^-divisor parametrizes the rank 
two-bundles with a global section. 

One observes that the situation in coding theory in a certain way is comple- 
mentary to that of applying GIT to build SUiiC, H), since the good (syndrome) 
points in coding theory are the bad ones for GIT, and vice versa. On the other 
hand the issue for those who work with moduli spaces is often precisely what 
to do with the unstable points, so the focal point of the theories are still in 
a certain sense overlapping. One is for example interested in blowing up var- 
ious s-negative strata of P to obtain compactifications of P — Sec t (C) and 



SU 2 (C,H) with desired properties. See Bertram [2] and PJ. One could hope 
that insight in such compactifications could be instrumental in understand- 
ing algorithms for decoding of Goppa codes. One can also ask: Is it possible 
that the minimum distance of C(D, G) exceeds d, or weaker: Given a reason- 
able small integer k Q , like 2, 3 or 17; is there a positive limit So, such that if 
s(P) < s = 2/iq — d, then there are at most k divisors A (even over F q ) of 
degree h 0l such that P is contained in Span(A)? For practical purposes this 
would in some situations be almost as good as unique decoding. In vector bun- 
dle language one is then interested maximal sublinebundles of P corresponding 
to divisors of type H — A. The question of such maximal subbundles is the 
main issue in Lange and Narasimhan [11J. 

Example 3.7. Assume g = 0. Then C is mapped into P = P d ~ 2 as a curve 
of degree d — 2, that rational normal curve. It is well-known that on 

C = P 1 all rank two bundles of degree d split as 0(a)© 0{b), with a + b = d. If 
d is odd, we then see the largest possible s-value is —1 = d— 2[(d+l)/2]. Hence 
all rank two-bundles are unstable, corresponding to the fact that P = Sec t (C) 
over F q . 
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